Non-Inclusion Property in Multi-Level Caches Revisited
نویسندگان
چکیده
The center of gravity of computer architecture is moving toward memory systems. Barring breakthrough microarchitectural techniques to move processor performance to higher levels, we are likely to see more and more on-chip cache memory in the years to come. Three levels of on-chip cache memory is not uncommon in recent designs. The increasing reliance on more on-chip cache requires more sophisticated cache design techniques and perhaps a rethinking of some cache concepts. We believe that a prime candidate for these concepts is the inclusion property. While simplifying memory coherence protocols in multiprocessor systems, this property makes inefficient use of cache memory real estate on the chip due to duplication of data on multiple levels of cache. Furthermore, strict enforcement of the inclusion property implies a “ripple effect” during updates where a single update in a cache level may result in several updates in the hierarchy above. In this paper, we discuss several non-inclusion cache strategies for future cache systems. We present some design alternatives for non-inclusive cache architectures. We show that the main advantage of a non-inclusive cache design arises from its relatively high level 2 (L2) hit rate, which enhances the overall average memory system access time. Non inclusive cache results in reducing L2 miss rate by 28 percent for specINT and 40 percent for specFP.
منابع مشابه
Dswitch: Write-aware Dynamic Inclusion Property Switching for Emerging Asymmetric Memory Technologies
Emerging non-volatile memory (NVM) technologies, such as spin-transfer torque RAM (STT-RAM), are attractive options for replacing or augmenting SRAM in implementing last-level caches (LLCs). However, the asymmetric read/write energy and latency associated with NVM introduces new challenges in designing caches where, in contrast to SRAM, dynamic energy from write operations can be responsible fo...
متن کاملWCET analysis of instruction cache hierarchies 1
With the advent of increasingly complex hardware in real-time embedded systems (processors with performance enhancing features such as pipelines, caches, multiple cores), most embedded processors use a hierarchy of caches. While much research has been devoted to the prediction of Worst-Case Execution Times (WCETs) in the presence of a single level of cache (instruction caches, data caches, impa...
متن کاملAn Analysis of Cache Sharing in Chip Multiprocessors
We present the effects of L1 and L2 cache sharing on cache miss rates, cache line invalidations, and constuctive and destructive interference. The most important finding of this paper is that a system configuration that shares L2 caches, does not share L1 caches, and does not enforce inclusion between the L1 and L2 caches will produce the highest performance cache and communication hierarchy fo...
متن کاملTiming Predictions for Multi-Level Caches
This paper describes a framework to extend timing predictions of real-time applications from architectures with primary caches to architectures with multi-level caches. First, general problems of extending timing analysis frameworks to multi-level memory hierarchies are discussed, including a method for dealing with uni-ed caches. Second, an existing data-ow framework for cache analysis, the fr...
متن کاملA Theory of Reads and Writes for Multi-level Caches
A wide range of program analysis and optimization techniques rely on the knowledge of the programs cache behavior. Thus soundness and precision of the underlying cache behavior model is important for these client analyses. Soundness and precision of the cache behavior models are becoming even more important with the recent advent of multi-core and many-core architectures due to two reasons. Fir...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- I. J. Comput. Appl.
دوره 14 شماره
صفحات -
تاریخ انتشار 2007